PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.0812s0002.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 435aa    MW: 48083.1 Da    PI: 7.256
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.0812s0002.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix66.26.9e-2134123186
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm....rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          +Wt++e+ aL++a+++++ +lrrg+l++++W++v++++      +g  +s++qC++k+e+l+kry+ +k+++ +r  + ss++ +f  l+
  Cagra.0812s0002.1.p  34 CWTDEETAALVNAYKDKWFALRRGNLRAADWDDVAAAVssssTVGGPPKSAIQCRHKIEKLRKRYRGEKQRSLNRPGKFSSSWELFPILD 123
                          8*************************************988888899****************************9999*******9887 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.4E-2133125No hitNo description
PROSITE profilePS500906.0863595IPR017877Myb-like domain
SMARTSM005950.006142130IPR006578MADF domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 435 aa     Download sequence    Send to blast
MSIPDDGSPV AMAIDSSTAV TVATTTTRRV PPPCWTDEET AALVNAYKDK WFALRRGNLR  60
AADWDDVAAA VSSSSTVGGP PKSAIQCRHK IEKLRKRYRG EKQRSLNRPG KFSSSWELFP  120
ILDAMGFAPV TPAAVETYDP DVDHDDESNG LDGFRVRSKR SGKFSGGYSD SPRDVGDGYG  180
VRSRSRSNMK LYGGFKSEFD SDHDSGSGFG LKRKYNGNPK VSADFDADSD DEIVLVPKAT  240
RLRTHGKPSS GDFSHSGGGG FPLKSFGDRN FASHGFKPKN FSKTEPNFSQ DLDYDDEFDD  300
DRSEREGFNP RIQSSRSSSR VNGYSRKDGS YPRNTGVSNG YGSSSRFKHE QMNAAAEVES  360
DPIDEVVSSV KMLTEMFVRV ENSKMEMMRE MEKSRMEMEL KHCQMMLESQ QQIIGAFAEA  420
LSEKKSTNAR RPVS*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK1182450.0AK118245.1 Arabidopsis thaliana At4g17060 mRNA for unknown protein, complete cds, clone: RAFL19-55-A07.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006283753.10.0hypothetical protein CARUB_v10004844mg
TrEMBLR0F5C60.0R0F5C6_9BRAS; Uncharacterized protein
STRINGBra040110.1-P0.0(Brassica rapa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM104831828
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G44730.15e-21Trihelix family protein